Secondary Structural Analysis of Families of Protein Sequences using Chaos Game Representation
نویسندگان
چکیده
CGR is an effective method for visualizing any structural features if it is given as a sequence of elements [1,2] analyzed by the genomic signature appears as a powerful tool for investigating the mechanisms of DNA maintenance from which the DNA structure results. It would be necessary to understand the patterns they exhibit and to be able to interpret them in a biologically meaningful way [3]. All informational macromolecules of biological interest are linear polymers. The subsequences of a genome exhibit the main characteristics of the whole genome, attesting to the validity of the genomic signature concept [2]. A great extent concentration has newly been focused on analyzing the biological sequences of both Deoxyribo Nucleic Acid (DNA) and proteins using the patterns observed in their graphical representations [1-9] and mathematical descriptions. These studies have shown applications in chemo informatics and bioinformatics. Chaos Game Representation (CGR) for gene (or DNA) sequences was introduced by [4,5] the underlying structures of genome sequences of a few model organisms that were obtained using CGR plots. CGR also offers new possibilities to resolve scale dependencies for information content in sequences [7].
منابع مشابه
An Artificial Neural Network Classifier for the Prediction of Protein Structural Classes
As there are quite a few difficulties for us to predict a protein structural class directly from its primary sequence, the protein structural prediction based on the predicted secondary structure will undoubtedly be the first choice we would like to take. Protein structural classes are generally defined as four classes: α, β, α/β, α +β. The protein secondary structure describes the local struct...
متن کاملRelation Between RNA Sequences, Structures, and Shapes via Variation Networks
Background: RNA plays key role in many aspects of biological processes and its tertiary structure is critical for its biological function. RNA secondary structure represents various significant portions of RNA tertiary structure. Since the biological function of RNA is concluded indirectly from its primary structure, it would be important to analyze the relations between the RNA sequences and t...
متن کاملPrediction of protein-protein interactions using chaos game representation and wavelet transform via the random forest algorithm.
Studying the network of protein-protein interactions (PPIs) will provide valuable insights into the inner workings of cells. It is vitally important to develop an automated, high-throughput tool that efficiently predicts protein-protein interactions. This study proposes a new model for PPI prediction based on the concept of chaos game representation and the wavelet transform, which means that a...
متن کاملAnalysis of genomic sequences by Chaos Game Representation
MOTIVATION Chaos Game Representation (CGR) is an iterative mapping technique that processes sequences of units, such as nucleotides in a DNA sequence or amino acids in a protein, in order to find the coordinates for their position in a continuous space. This distribution of positions has two properties: it is unique, and the source sequence can be recovered from the coordinates such that distan...
متن کاملEncoding DNA sequences by integer chaos game representation
Motivation: DNA sequences are fundamental for encoding genetic information. The genetic information may be understood not only by symbolic sequences but also from the hidden signals inside the sequences. The symbolic sequences need to be transformed into numerical sequences so the hidden signals can be revealed by signal processing techniques. All current transformation methods encode DNA seque...
متن کامل